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Abstract 


A search for a narrow, high-mass resonance decaying into Z and Higgs (H) bosons is 
presented. The final state studied consists of a merged jet pair and a r pair resulting 
from the decays of Z and H bosons, respectively The analysis is based on a data sam¬ 
ple of proton-proton collisions at a center-of-mass energy of 8 TeV, collected with the 
CMS experiment in 2012, and corresponding to an integrated luminosity of 19.7 fb -1 . 
In the resonance mass range of interest, which extends from 0.8 to 2.5 TeV, the Z and 
H bosons are produced with large momenta, which implies that the final products 
of the two quarks or the two r leptons must be detected within a small angular in¬ 
terval. From a combination of all possible decay modes of the r leptons, production 
cross sections in a range between 0.9 and 27.8 fb are excluded at 95% confidence level, 
depending on the resonance mass. 
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1 Introduction 

Very recently, the validity of the standard model (SM) of particle physics has been confirmed 
by the discovery of a Higgs boson with mass near 125 GeV by the ATLAS and CMS experi¬ 
ments nm Though the SM successfully describes a broad range of high energy phenomena, 
the solution to remaining problems with the structure of the SM, particularly the hierarchy 
problem, leads naturally to the introduction of physics beyond the standard model (BSM), pos¬ 
sibly at the TeV scale ||3HEI- Many of the BSM models predict the existence of heavy resonances 
with masses of the order of a TeV, which may have sizable couplings to the gauge and Higgs 
boson fields of the SM H9 UI21 . We consider here one important family among these models, 
which incorporate composite Higgs bosons IfTTl IT21 . In these models, the Higgs boson is a 
pseudo-Nambu-Goldstone boson of a broken global symmetry Other composite bound states 
beyond the Higgs boson are expected to exist and could be experimentally observed. 

Several searches for massive resonances decaying into pairs of vector bosons or Higgs bosons 
have been performed by the ATLAS and CMS experiments fl3H24l . In this analysis, we search 
for a resonance with a mass in the range 0.8-2.5 TeV decaying to ZH, where the Z boson decays 
to qq and the Higgs boson decays to r 1 r . It is assumed that the natural width of the resonance 
is negligible in comparison to the experimental mass resolution, which is between 6% and 10% 
of the mass of the resonance, depending on the mass. There is also a small variation with the 
type of decay channel because of the dependence of the resolution on the number of neutrinos 
in the final state. In the model considered, the spin of the resonance is assumed to be one. 
However, it has been verified that the analysis is insensitive to the angular distributions of the 
decay products and therefore applies to other spin hypotheses. 

The theoretical model used as benchmark in this work is described in Ref. 1251 . In this model 
a heavy SU(2)i vector triplet (HVT) containing neutral (Z') and charged (W ± ) spin-1 states 
is introduced. This scenario is well-motivated in cases where the new physics sector is either 
weakly coupled l26l . or strongly coupled, e.g., in the minimal composite model 112711 . The cross 
sections and branching fractions ( B) for the heavy triplet model depend on the new physics 
scenario under study and can be characterized by three parameters in the phenomenological 
Lagrangian: the strength of the couplings to fermions Cf, to the Higgs Ch, and the self-coupling 
gy. In the case of a strongly coupled sector, the new heavy resonance has larger couplings 
to the W, Z, and H bosons, resulting in larger branching fractions for the diboson final states. 
Our benchmark model characterizes this scenario by choosing the parameters gy = 3 and 
cp = — Ch = 1, which configure a strongly coupled sector. 

In the high-mass case under study, the directions of the particles stemming from Z and H boson 
decays are separated by a small angle. This feature is referred to as the "boosted" regime. For 
the case of Z —>■ qq, this results in the presence of one single reconstructed jet after hadroniza- 
tion called a "Z-jet". The novel feature of this analysis is the reconstruction and selection of a r 
pair in the boosted regime. The presence of missing energy in r decays does not allow a direct 
determination of the invariant mass. 

In the following, we label r decays in a simplified way: r ± —» e : vv as "r e ", r ± —» ft : vv as 
"t h”, and W —» (nn) (niK)v as "Th", where n and rn can be 0,1,2, or 3, and the pions and kaons 
can be either charged or neutral. Six channels, depending on the combinations of r decays, are 
studied separately and labeled as all-leptonic (T e r e , x e x (1 , x„x„), semileptonic (x e Xh, x ?/ Xh), and 
all-hadronic (XhXh). 

The experimental strategy is to reconstruct and identify the two bosons and to combine their 
information into a variable that can discriminate between signal and background and on which 
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a statistical study can be performed. This variable is the estimated mass of the Z' after applying 
dedicated reconstruction techniques to the boosted qq and tt pairs (otzh)- The ;«zh distribu¬ 
tion would show an excess of events at the assumed Z' mass if a signal were present. 

2 CMS detector 

A detailed description of the CMS detector, together with a definition of the coordinate sys¬ 
tem used and the relevant kinematic variables, can be found in Ref. (28). The central feature 
of the CMS detector is a 3.8 T superconducting solenoid of 6 m internal diameter. Within the 
field volume are the silicon tracker, the crystal electromagnetic calorimeter (ECAL), and the 
brass and scintillator hadron calorimeter (HCAL). The muon detectors are located outside the 
solenoid and are installed between the layers of the steel flux-return yoke of the solenoid. In ad¬ 
dition, CMS has extensive forward calorimetry, in particular two steel and quartz-fiber hadron 
forward calorimeters. 


3 Data sample and simulation 

The analysis is based on a data sample collected by the CMS experiment in proton-proton colli¬ 
sions at a center-of-mass energy of 8 TeV in 2012, corresponding to an integrated luminosity of 
19.7 fb 1 . Events are selected online by a trigger that requires the presence of at least one of the 
following: either a hadronic jet reconstructed by the anti -kj algorithm H29l with a distance pa¬ 
rameter of 0.5, transverse momentum pj larger than 320 GeV, and ;/| < 5.0; or a total hadronic 
transverse energy, Hj, defined as the scalar sum of the transverse energy of all the jets of the 
event, larger than 650 GeV. The transverse energy of a jet is defined as the reconstructed energy 
multiplied by the sine of the polar angle of the jet axis. Using events selected by less restric¬ 
tive, pre-scaled triggers, it has been verified that the efficiency of this trigger after applying the 
offline event selection is above 99%. The difference from 100% is considered as a systematic 
uncertainty. 

The process qq —> Z' —> ZH —> qqr + T~ is simulated at parton level using a MadGraph 
5 1.5.11 l30fl implementation of the model described in Ref. l3ll . Seven signal samples are 
generated with masses between 0.8 and 2.5 TeV. For this mass interval, the Z' production 
cross section times branching fraction to ZH ranges from 179.9 fb ( niz> = 0.8 TeV) to 0.339 
fb (niz’ = 2.5 TeV). Although the main sources of background are estimated using observed 
events, Monte Carlo (MC) simulations are used to develop and validate the methods used in 
the analysis. Background samples are generated using MadGraph 5 1.3.30 (Z/y+jets and 
W+jets with leptonic decays), POWHEG 1.0 rl380 (tt and single top quark production) Il32j - |35l , 
and PYTHIA 6.426 11361 (SM diboson production and QCD multijet events with large Hj). Show¬ 
ering and hadronization are performed with PYTHIA and r decays are simulated using TAUOLA 
1.1.5 1571 for all simulated samples. Geant4 If38l is used for the simulation of the CMS detector. 


4 Event reconstruction 

A particle-flow (PF) algorithm 11391 140 ] is used to identify and to reconstruct candidate charged 
hadrons, neutral hadrons, photons, muons, and electrons produced in proton-proton collisions. 
Jets and r h candidates are then reconstructed using the PF candidates. The jet energy scale is 
calibrated through correction factors that depend on the pj and ;/ of the jet. These factors 
were computed using a data set of proton-proton collisions at i/s = 8 TeV, corresponding to 
an integrated luminosity of 19.7 fb -1 , following the method described in |[4TI . All particles 
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reconstructed with the PF algorithm are used to determine the missing transverse momentum, 
p™' ss . In first approximation, pj' ss is defined as the negative vector sum of transverse momenta 
of all reconstructed particles @2). 

Jets are reconstructed using the Cambridge-Aachen (CA) algorithm 03), with a distance pa¬ 
rameter of 0.8, chosen so that it contains the hadronization products of the two quarks from the 
Z boson. Jet pruning and subjet-searching algorithms are applied to these jets as in Ref. Ifl7| . 
In these algorithms the original jets are re-clustered by removing pileup and underlying-event 
particles at low-px arid large angle. The term pileup refers to additional interactions occurring 
in the same LHC bunch crossing. We define m? t as the invariant mass of the jet constituents 
after the pruning procedure. This invariant mass provides good discrimination between Z-jets 
and quark/gluon-jets since it tends to be shifted towards the energy scale at which the jet was 
produced. We also define a quantity called "N-subjettiness", T,y, that is sensitive to the different 
jet substructure characteristics of quark/gluon and Z-jets, as 04): 

\ 

t n = -r Y, Pj,k min(AR u , A R 2 , k , ■■■, AR N , fc ), (1) 

«o k 

where N is the number of subjets in which the original jet can be reclustered with the kj algo¬ 
rithm 051146) : the index k runs over the PF constituents of the jet; pjj ( is the transverse momen¬ 
tum of the /cth constituent; A R thk is a distance defined as V (A’lnjc) 2 + (Acp„ / f ) 2 where A//„ / c and 
A(p, hk are the differences in pseudorapidity and azimuthal angle between the /cth constituent 
and the nth subjet axis; and do = Xjt Pi,k R-o is a normalization factor with Ro equal to the orig¬ 
inal jet distance parameter. The variable Ty quantifies the tendency of a jet to be composed of 
N subjets, having smaller values for jets with a N-subjets-like configuration. We define T21 as 
the ratio between the 2-subjettiness and the 1-subjettiness, T 21 = T 2 /T 1 . The variables m? et and 
T21 have been shown to have a good discrimination power between signal and background 07) . 
therefore in the following they are used to define signal and background enriched regions of 
the analysis. 

In order to match trigger requirements and avoid inefficiencies close to the threshold, at least 
one jet in the event is required to have pj > 400GeV and \rj\ < 2.4. In addition, this jet 
is required to pass minimal consistency requirements on the fraction of charged and neutral 
particles contributing to it, to avoid fake jets from isolated noise patterns in the calorimeters or 
the tracker systems. While the CA jet selection is common to all the channels considered, the 
reconstruction of the rr system is performed differently depending on the r decay channel. 

The all-leptonic channels are identified by combinations of electrons, muons, and p| 11lss , which 
are products of the decay of a pair of r leptons from the Higgs boson. Electrons are recon¬ 
structed by combining the information from an ECAL energy cluster with that of a matching 
track in the silicon tracker 08) . Electrons are selected if they have pj > 10 GeV, |;/| < 2.5, 
and satisfy requirements on the ECAL shower shape, the ratio of energies measured in HCAL 
and ECAL around the electron candidate, the compatibility with the primary vertex of the 
event 09) , and the track-cluster matching parameters. Muon candidates Ii50l are reconstructed 
by performing a global track fit in which the silicon tracker and the muon system information 
is combined. For the channel, to avoid identification inefficiencies caused by the small 
angular separation of the two muon trajectories, the second muon candidate is reconstructed 
with a different algorithm in which tracks in the silicon tracker are matched in space to signals 
in the muon detectors Cl7l . Muons are required to have pj > 10GeV, \)j\ < 2.4 and to pass 
additional requirements on the quality of the track reconstruction, on the impact parameter of 
the track, and on the number of measurements in the tracker and the muon systems. Electron 
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and muon candidates are required to satisfy particle-flow based isolation criteria that require 
low activity in a cone around the lepton, the isolation cone, after the removal of particles due 
to additional interactions. Because the lepton from the other signal r decay in the boosted pair 
can fall in the isolation cone, other electrons and muons are not considered in the computation 
of the isolation criteria. 

In the semileptonic channels, a lepton selected with all the criteria above is combined with a q, 
candidate. The reconstruction of q, starts from the clustering of jets using the anti-/cj algorithm 
with a distance parameter of 0.5. Electrons and muons, identified by looser criteria than the 
nominal ones used in the analysis, are removed from the list of particles used in the clustering 
if they fall within the jet distance parameter. The q, is reconstructed and identified using the 
"hadron-plus-strips" technique 151) , which searches for the most common decay modes of the 
7h starting from charged hadrons and photons forming 7r° candidates. We select ry candidates 
with pj > 20GeV and \rj\ < 2.3. Electrons and muons misidentified as Ty are suppressed 
using dedicated criteria based on the consistency between the measurements in the tracker, the 
calorimeters, and the muon detectors. Finally, loose PF-based isolation criteria are applied to 
the ry candidates, not counting electrons and muons in the cone. 

In the all-hadronic tt channel, a subjet-searching technique H52l is applied to all CA-jets (dis¬ 
tance parameter R = 0.8) in each event to identify the iy candidates. At the next-to-last step 
of the clustering algorithm, there are two subjets, which are ordered by mass. If both have 
pj > 10 GeV and the mass of the leading subjet is smaller than 2/3 of the mass of the original 
merged jet, the two objects are used as seeding jets for r lepton reconstruction via the "hadron- 
plus-strips" technique. If any of the criteria above fail, the procedure for one of the subjets is 
performed again for a maximum of four iterations. The efficiency for finding subjets with this 
method in signal events is 92%, independent of pj, for iy with pj > 40 GeV. In the lowest bin 
investigated (pj between 20 and 40 GeV) the efficiency is around 80%. 

The visible mass, m v is, of the tt system is defined as the invariant mass of all detectable prod¬ 
ucts of the two decays. Because the unobserved neutrinos can carry a significant fraction of 
the tt energy/momenta, this variable is not suited for reconstructing resonances that include 
the tt system among its decay products. Instead, the Secondary Vertex fit (SVFIT) algorithm 
described in [[531 , which combines the pj ,ss with the visible momenta to calculate a more pre¬ 
cise estimator of the kinematics of the parent boson, is used to reconstruct the tt system in all 
search channels. 


5 Background composition 

The composition of the background remaining after reconstruction is different for each of the 
search channels. 

In the T e T e , q,q„ and t ;/ t ?/ channels, the background is almost entirely composed of Z/q+jets 
events with genuine T or other lepton decays. In the qq and qq channels, additional sig¬ 
nificant contributions to the total background come from W+jets and tt events with leptonic 
W-boson decays, and a hadronic jet misidentified as q. Among tt events, those with one W 
boson decaying leptonically and one decaying to quarks can potentially produce a signal-like 
structure in m? t and T 21 . We refer to this as the "tt peaking contribution" in the following. 

The background in the qq channel is dominated by QCD multijets production. There is a 
small but non-negligible contribution from Z+jets, W+jets, and tt production. For all these 
processes, it is possible that genuine q or at least one extra jet or lepton misidentified as q 
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allow the event to pass the selection. 

In all channels there is a very small, irreducible component of genuine SM dibosons, which are 
not distinguishable from signal, except for the non-peaking structure in myy\. 


6 Event selection 

In all channels, the boosted Z boson decaying to qq is identified by requiring the selection: 
70 < m? et < 110 GeV and T 21 < 0.75. This region is referred to as the "signal region". 

In the all-leptonic and semileptonic channels, the rr four-momentum estimated from SVFIT 
is combined with that of the CA-jet to obtain the resonance mass nizn- Several preselection 
requirements are applied to remove backgrounds from low-mass resonances and from overlaps 
of lepton and r lepton reconstruction in the detector: m v \ s > 10 GeV, A> 0.1 (where A R = 
V (A/ 7) 2 + (A cp) 2 and £ denotes electrons, muons, or hadronically decaying taus), |p™' ss > 
20 GeV, and Pt, T t > 100 GeV, as estimated from the SVFIT procedure. 

Since the background in the all-hadronic channel is initially dominated by QCD multijet events, 
a different preselection is applied for the all-hadronic channel. Only events that have not 
been included in the all-leptonic or semileptonic categories are considered in this category. 
The event is then separated into two hemispheres containing the decay products of the two 
bosons by requiring the following preselection: |p™ iss | > 40GeV, |A<J>(CA-jet, Th)| > 2.0 and 
|A<Kp.T s , Th) | <1.5, for each of the two ip candidates. 

Further criteria investigated for signal selection in all channels include tighter requirements 
on variables like the pj of the highest-p? (leading) lepton or and rn TT as estimated from the 
SVFIT procedure. An upper limit is placed on Ain order to reject W+jets events, where a 
jet misidentified as a t lepton is usually well-separated in space from the isolated lepton. The 
number of b jets in the event also provides a useful criterion to reduce the tt contribution. Jets 
may be identified as b jets, using the combined secondary vertex algorithm |54| which exploits 
observables related to the long lifetime of b hadrons, and are considered if not overlapping 
with r candidates and CA-jets. Those b jets are clustered with the anti-fcj jet algorithm, with 
a distance parameter R = 0.5. Optimization of the selection on these variables is based on the 
Punzi factor of merit (V) If55ll , defined as: V = £ S i g / (1 + VB), where £ s i g is the signal efficiency 
and B is the background yield after applying the selection. The results of the optimization 
are listed in Table Q] It has been verified that these results are not sensitive to the choice of 
m ZH window used to evaluate £ S i g and B. In Table [2] we show the efficiency of the selection in 
signal events for all search channels. 

Table 1: Summary of the optimized event selection for the six tt channels. The selection vari¬ 
ables are explained in the text. The label £ refers to electrons, muons, and t leptons decaying 
hadronically. 


Selection 

T e T e , T e T|„ T^T u 

Tshv IjiTK 

TiTh 

\Pt 1ss \ 

>100 GeV 

>50 GeV 

>80 GeV 

leading 

Pt,c 

— 

>35 GeV 

>50 GeV 

hib-tagged jet 

A-R« 

= 0 
<1.0 

= 0 
<1.0 

<1.0 

m TT 

— 

— 

105-180GeV 
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7 Background estimation 

Because of the non-uniformity of the background composition, different estimation techniques 
are used in each channel. 

In the T e T e , T e T„, and Typ, channels the main background source lacks events with a genuine 
massive boson decaying to quarks, therefore a technique based on sidebands of the mb and 
T 21 variables is used for background estimation. In an enlarged search region defined by 
m? et > 20GeV, we define the "sideband region", inverting the selections on m? t and T 21 , there¬ 
fore including both m? t regions outside the signal range and regions with T 21 > 0.75. 

The total background is estimated in intervals of m zh/ using the formula: 

Nbkg(x) = Af N sh {x ) a(x), (2) 

where x = Af is a normalization factor, N s b(x) is the number of events observed in the 
sideband region, in bins of rriyu, and cc(x) is a binned ratio between the shapes of the /»/h distri¬ 
butions in the signal and sideband region, taken from the sum of MC components. The normal¬ 
ization factor is found through a fit of the observed pruned jet mass distribution, following the 
procedure used in Ref. il7l . The pruned jet mass distribution in the region 20 < m? ei < 200 GeV, 
T 21 < 0.75 is fit in MC samples with the following function: 

F(x) = Af e ax (1 + erf[(x — b)/c]), (3) 

where "erf" is the error function and the parameters a, b and c are estimated from the MC 
simulation. A fit to the observed distribution, excluding the signal region, is then used to 
determine Af. Figure [I] shows the observed distributions of 7 «zh in all-leptonic channels, along 
with the corresponding MC expectations for signal and background, as well as the background 
estimation derived with the above procedure. 

In the semileptonic channels, a control sample defined by the preselection described before, 
but requiring at least one b-tagged jet, is selected. It has been established with simulation that 
more than 95% of this sample is composed of tt events. Two scale factors (SFs) relating the ratio 
of the observed to simulated event rates, one for the tt peaking contribution and the other for 
the tt combinatorial background, are estimated from this control sample. The pruned jet mass 
distribution is fit with the sum of two functions: 

F t j(x) = N(non-peaking) e Ax (l +erf[(x — B)/C]) + N(peaking) Q(D,E) (4) 

where A, B, and C define the shape of the non-peaking component, analogous to Eq. (|3j, and 
Q(D,E ) is a Gaussian function of mean D and standard deviation E. The values of these 
two parameters are fixed to those found in the analysis searching for vector boson pair res¬ 
onances fl7l because we are using the same Z-jet reconstruction. From this fit, the two scale 
factors between data and MC are found, one for each contribution: r FF = N (peakingj dala / 
N(peaking )mc and rf F = N(non-peaking) data /N(non-peaking)MC- The same procedure as for 
the all-leptonic channels is then applied, fitting the observed sideband distribution but using 
a modified function, given by the sum of the tt contribution and the function of Eq. (J3|), where 
the tt normalization is fixed at the MC expectation, scaled by the two SFs. Figured shows the 
distributions of niyn in semileptonic channels, along with the corresponding MC expectations 
and the background estimation derived with the above procedure. 

For each of the methods used, consistency checks comparing data and background predictions 
are performed using samples of events at the preselection level, that are expected to have small 
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contributions from potential signal resonances. In the case of the semileptonic channels, we 
show in Fig.|3jthe distribution of m? et for data and MC at the preselection level. The black line, 
representing the fit to data, is obtained by the sum of Eqs. ^ and (Q, with the tt shape as 
obtained from the control sample, the tt normalization is fixed to MC scaled by the two SFs, 
and the other components are free in the sideband fit. An overall agreement between data and 
prediction is observed. The background prediction in the signal region is 156 ± 26 events, with 
an observation of 151 events, for the T e T b channel and 204 ± 31 events, with an observation of 
203 events, for the Tyry channel. 

In the all-hadronic channel, for events where the leading jet satisfies the requirement T 21 < 0.75, 
a plane is defined using the and m TT variables and four regions are considered, as shown in 
Fig. [4] Most of the signal events are expected in region A, while regions B, C, D are dominated 
by background events. Studies of the correlation factors for simulated events and in regions or¬ 
thogonal to the signal region show that the variables m? et and m TT are essentially uncorrelated. 
In this case, the total number of background events in the region A can be estimated as: 

N bkg = (N b N d )/Nc. (5) 

The method described by Eq. |5]), called "ABCD method", gives a background prediction in the 
signal region that has been checked to be insensitive to possible signal contamination in the 
regions B, C, D. 

Figure [5] shows the observed distributions of ;«zh in the T b T b channel, along with the corre¬ 
sponding MC expectations for signal and background. The low number of events in regions 
B, C, D is not sufficient to derive the shape of the distribution in the signal region using the 
ABCD method. We use the results from this method to compute the cross section upper lim¬ 
its, which are obtained without assumptions about the shape of the distributions. The ABCD 
method is checked using an alternative background estimation technique, where tt, W+jets and 
Z+jets background contributions are given by Eq. (j2jl, while the QCD multijet background is 
estimated from a control sample of events where at least one r candidate fails the isolation re¬ 
quirement. The same control sample is used to obtain the shape of the QCD distribution in the 
signal region presented in Fig. [5] 

8 Systematic uncertainties 

The sources of systematic uncertainty in this analysis, which affect either the background esti¬ 
mation or the signal efficiencies, are described below. 

For the signal efficiency, the main uncertainties come from the limited number of signal MC 
events (3-10%), the integrated luminosity (2.5%) II56H , and the uncertainty on the modeling of 
pileup (0.2-2.2%). Hereafter, the ranges indicate the different channels and mass regions used 
in the evaluation of the upper limits. The scale factors for lepton identification are derived from 
dedicated analyses of observed and simulated Z —y i + i~ events, using the "tag-and-probe" 
method 11501 [51] 1571 . The uncertainties in these factors are taken as systematic uncertainties and 
amount to 1-4% for electrons, 1-6% for muons and 9-26% for r leptons decaying hadronically. 
The jet and lepton four-momenta are varied over a range given by the energy scale and resolu¬ 
tion uncertainties fill . In this process, variations in the lepton and jet four-momenta are prop¬ 
agated consistently to pjF lss . For the all-leptonic and semileptonic channels, additional uncer¬ 
tainties come from the procedure of removing nearby tracks and leptons used in the hadronic r 
reconstruction, and from the isolation variable computation in the case of boosted topologies. 
The inefficiency resulting from these procedures, as measured in signal simulation, is assigned 
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sponding MC expectations for signal and background: (left) T c T h category; (right) T„Th category. 
Fits are performed for MC and data (as discussed in the text). 
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Figure 4: Definitions of the A, B, C, and D regions in the m? t / m TT plane used in the background 
estimation for the all-hadronic channel. 
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Figure 5: Observed distributions of mzzh for the ThT|, category along with the corresponding 
MC expectations for signal and background. Ten equal-size histogram bins cover the region 
from 0 to 2.5 TeV, while a single bin is used at higher wizh because of the limited number of MC 
and data events. The signal cross section is scaled by a factor of 5. 
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9 Results 


as a systematic uncertainty, corresponding to 1-16% for r reconstruction and 1-21% for isola¬ 
tion. In the all-hadronic analysis, a constant uncertainty of 10% is assigned for the application 
of the r reconstruction procedure to collimated subjets, comparing the performance for iso¬ 
lated and non-isolated r leptons in simulation. The jet trigger efficiency has an uncertainty 
of <1%, as determined from a less selective trigger. Following the method derived for vector 
boson identification in merged jets |[58ll , a scale factor of 0.94 ± 0.06 is used for the efficiency 
of the pruning and subjet searching techniques applied on the CA jet, where the uncertainty is 
included in the estimation of the overall systematic uncertainty. For the b tagging, data-to-MC 
corrections derived from several control samples are applied and the uncertainties on these 
corrections are propagated as systematic uncertainties in the analysis (2-6%). The procedure 
used to derive the b-tagging systematic uncertainties is described in Ref. Il54ll . 

The uncertainties in the background estimate are dominated by the limited numbers of MC 
events and sideband data events (4-16 events in all-leptonic channels, 34-37 events in semilep- 
tonic channels and 29 in the all-hadronic channels). In the analysis of the all-leptonic and 
semileptonic channels, additional uncertainties in the background yields of 10-96% originate 
from the limited number of events of the background MC samples used in the computation of 
the a{x) quantity, and 18-47% from the normalization fit. 


9 Results 

Table [2] shows the signal efficiencies (computed using a sample generated with corresponding 
r decays), the background expectation and the number of observed events for the six analysis 
channels. 

Having observed no significant deviations in the observed number of events from the expected 
background, we set upper limits on the production cross section of a new resonance in the ZH 
final state. We use the CL S criterion 1 59 1 |6(J| to extract upper bounds on the cross section, 
combining all six event categories. The test statistic is a profile likelihood ratio Ii6ll and the 
systematic uncertainties are treated as nuisance parameters with the frequentist approach. The 
nuisance parameters are described with log-normal prior probability distribution functions, 
except for those related to the extrapolation from sideband events, which are expected to fol¬ 
low a T distribution B6lH . In the all-leptonic and semileptonic channels, the numbers of signal 
and background events are calculated for a region corresponding to ±2.5 times the expected 
resolution around each mass point in mzH/ while in the all-hadronic channel we consider the 
number of expected background, signal and observed events in viz H > 800 GeV for each mass 
point. The expected and observed upper limits are shown in Fig. [6] Production cross sections 
times branching fraction in a range between 0.9 and 27.8 fb, depending on the resonance mass 
(0.8-2.5 TeV), are excluded at a 95% confidence level. 

In Fig. [6j the results from this analysis are also compared to the cross section of the theoretical 
model, used as benchmark in this paper and studied in Ref. (25). In this model, the parameters 
are chosen to be gv = 3 and Cp = —c h = 1, corresponding to a strongly coupled sector. In Fig.[7j 
a scan of the coupling parameters and the corresponding regions of exclusion in the HVT model 
are shown. The parameters are defined as gyc h and g 2 c?/gy, related to the coupling strength 
of the new resonance to the Higgs boson and to fermions. Regions of the plane excluded by 
this search are indicated by hatched areas. Ranges of the scan are limited by the assumption 
that the new resonance is narrow. 
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Table 2: Summary of the signal efficiencies, number of expected background events, and num¬ 
ber of observed events for the six rr channels. Only statistical uncertainties are included. For 
the all-leptonic and semileptonic channels, numbers of expected background events and ob¬ 
served events are evaluated for each mass point in mzn intervals corresponding to ±2.5 times 
the expected resolution. For the all-hadronic channel we consider the number of expected 
background, signal, and observed events for nizH > 800 GeV. When the expected background 
is zero, the 68% confidence level upper limit is listed. 


Mass (TeV) 

T e T e 

T e Tf, 


TeT h 

% T h 

ThT h 

B{ rr) 

3.2% 

6.2% 

3.0% 

23.1% 

22.6% 

41.9% 

e sig (To) 

0.8 

2.8 ± 0.7 

3.4 ± 0.5 

4.2 ± 0.7 

3.3 ± 0.3 

4.4 ± 0.3 

2.2 ± 0.2 

0.9 

11 ± 1 

16 ± 1 

20 ±2 

14.3 ± 0.5 

18.7 ± 0.6 

11.5 ± 0.4 


1.0 

17 ±2 

24 ± 1 

38 ±2 

21.2 ± 0.6 

29.3 ± 0.7 

18.0 ± 0.5 


1.2 

26 ±2 

30 ± 1 

39 ±2 

28.3 ± 0.7 

35.8 ± 0.7 

23.0 ± 0.5 
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30 ±2 

42 ± 2 

53 ±2 

29.2 ± 0.8 

38.1 ± 0.9 
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39 ±2 

56 ±3 

31.1 ± 0.8 

39.2 ± 0.9 

31.9 ± 0.7 
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27 ±2 

37 ± 2 

42 ±2 

26.8 ± 0.8 

37.0 ± 0.8 

30.1 ± 0.7 

^bkg 

0.8 

0.3 ± 0.5 

1.1 ± 0.8 

1.6 ±1.2 

6.1 ± 2.0 

6.7 ±2.1 
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1.7 ± 1.2 

3.8 ± 2.1 

9.8 ± 3.2 

9.2 ± 2.9 
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Figure 6: Expected and observed upper limits on the quantity cr(Z') B(Z' —? ZH) for the six 
analysis channels combined. Green and yellow bands correspond to ±1 or ±2(7 variations on 
the expected upper limit, respectively. 

10 Summary 

A search for a highly massive (>0.8 TeV) and narrow resonance decaying to Z and H bosons 
that decay in turn to merged dijet and t ! t final states has been conducted with data samples 
collected in 8 TeV proton-proton collisions by the CMS experiment in 2012. For a high-mass 
resonance decaying to much lighter Z and H bosons, the final state particles must be detected 
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CMS 19.7 fb' 1 (8 TeV) 



Figure 7: Exclusion regions in the plane of the HVT-model coupling constants g 2 Cv/gw) 

for two resonance masses, 1.0 and 1.5 TeV. The point B of the benchmark model used in the 
analysis, corresponding to gw = 3 and Cf = — Ch = 1, is also shown. The boundaries of the 
regions of the plane excluded by this search are indicated by the dashed and dotted lines, and 
associated hatching. The areas indicated by the solid line and solid shading correspond to 
regions where the theoretical width is larger than the experimental resolution of the present 
search and thus the narrow-resonance assumption is not satisfied. 

and reconstructed in small angular regions. This is the first search performed by adopting 
novel and advanced reconstruction techniques to accomplish that end. From a combination of 
all possible decay modes of the r leptons, production cross sections in a range between 0.9 and 
27.8 fb, depending on the resonance mass (0.8-2.5 TeV), are excluded at a 95% confidence level. 
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